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Abstract. Ariola and Felleisen's call-by-need A-calculus replaces a vari- 
able occurrence with its value at the last possible moment. To sup- 
port this gradual notion of substitution, function applications — once 
established — are never discharged. In this paper we show how to trans- 
late this notion of reduction into an abstract machine that resolves vari- 
able references via the control stack. In particular, the machine uses the 
static address of a variable occurrence to extract its current value from 
the dynamic control stack. 



1 Implementing Call-by-need 

Following Plotkin T, Ariola and Felleisen characterize the by-need A-calculus 
as a variant of /?: 

{Xx.E[x]) V = {Xx.E[V]) V , 

and prove that a machine is an algorithm that searches for a (generalized) value 
via the leftmost-outermost application of this new reduction [2] . 
Philosophically, the by-need A-calculus has two implications: 

1. First, its existence says that imperative assignment isn't truly needed to 
implement a lazy language. The calculus uses only one-at-a-time substitu- 
tion and does not require any store-like structure. Instead, the by-need /3 
suggests that a variable dereference is the resumption of a continuation of 
the function call, an idea that Garcia et al. [3] recently explored in detail by 
using delimited control operations to derive an abstract machine from the 
by-need calculus. Unlike traditional machines for lazy functional languages, 
Garcia et al.'s machine eliminates the need for a store by replacing heap 
manipulations with control (stack) manipulations. 

2. Second, since by-need /? does not remove the application, the binding struc- 
ture of programs — the association of a function parameter with its value — 
remains the same throughout a program's evaluation. This second connection 
is the subject of our paper. This binding structure is the control stack, and 
thus we have that in call-by-need, static addresses can be resolved in the 
dynamic control stack. 
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Our key innovation is the CK+ machine, which refines the abstract machine 
of Garcia et af. by making the observation that when a variabie reference is in 
focus, the location of the corresponding binding context in the dynamic control 
stack can be determined by the lexical index of the variable. Whereas Garcia 
et al.'s machine linearly traverses their control stack to find a specific binding 
context, our machine employs a different stack organization where indexing can 
be used instead of searching. Our machine organization also simplifies the hygiene 
checks used by Garia et al., mostly because it explicitly maintains Garcia et al.'s 
"well-formedness" condition on machine states, instead of leaving it as a side 
condition. 

The paper starts with a summary of the by-need A-calculus and the abstract 
textual machine induced by the standard reduction theorem. We then show how 
to organize the machine's control stack so that when the control string is a 
variable reference, the machine is able to use the lexical address to compute the 
location of the variable's binding site in the control stack. 

2 The Call-by-need A-calculus, the de Bruijn Version 

The terms of the by-need A-calculus are those of the A-calculus [3], which we 
present using de Bruijn's notation i.e., lexical addresses replace variables: 

M ::= n \ A.M | M M 

where n g N. The set of values is just the set of abstractions: 

V ::= X.M 

One of the fundamental ideas of call-by-need is to evaluate the argument in 
an application only when it is "needed," and when the argument is needed, to 
evaluate that argument only once. Therefore, the by-need calculus cannot use 
the /3 notion of reduction because doing so may evaluate the argument when it is 
not needed, or may cause the argument to be evaluated multiple times. Instead, 
/3 is replaced with the deref notion of reduction: 

i\.E[n]) V need {\.E[V]) V, A binds n deref 

The deref notion of reduction requires the argument in an application to be a 
value and requires the body of the function to have a special shape. This special 
shape captures the demand-driven substitution of values for variables that is 
characteristic of call-by-need. In the deref notion of reduction, when a variable 
is replaced with the value V, some renaming may still be necessary to avoid 
capture of free variables in V, but for now, we assume a variant of Barendregt's 
hygiene condition for de Bruijn indices and leave all necessary renaming implicit. 
Here is the set of evaluation contexts E: 



E ::= 



[]\E M \ (X.E) M I iX.E'[n]) E 
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Like all contexts, an evaluation context is an expression with a hole in the place 
of a subexpression. The first evaluation context is an empty context that is just a 
hole. The second evaluation context indicates that evaluation of applications pro- 
ceeds in a leftmost-outermost order. This is similar to how evaluation proceeds 
in the by-name A-calculus [1]. Unlike call- by- name, however, call-by-need defers 
dealing with arguments until absolutely necessary. It therefore demands evalua- 
tion within the body of a let-like binding. The third evaluation context captures 
this notion. This context allows the deref notion of reduction to search under 
applied As for variables to substitute. The fourth evaluation context explains 
how the demand for a parameter's value triggers and directs the evaluation of 
the function's argument. In the fourth evaluation context, the visible A binds n 
in X.E'[n\. This means that there are n additional A abstractions in E' between 
n and its binding A. 

To make this formal, let us define the function Z\ :£'—!> N as: 

zi([]) = AiiX.E'in]) E) ^ AiE) 

A{E M) = A{E) AiiX.E) M) = A{E) + 1 

With Z\, the side condition for the fourth evaluation context is n = A{E'). 

Unlike /?, deref does not remove the argument from a term when substitu- 
tion is complete. Instead, a term (A.M) N is interpreted as a term M and an 
environment where the variable (index) bound by A is associated with N . Since 
arguments are never removed from a by-need term, reduced terms are not nec- 
essarily values. In the by-need A-calculus, reductions produce "answers" a (this 
representation of answers is due to Garcia et al. [3]): 

a ::— A[V] answers 
A [ ] I {\.A) M answer contexts 

Answer contexts A are a strict subset of evaluation contexts E. 

Since both the operator and the operand in an application reduce to answers, 
two additional notions of reduction are needed: 

iX.A[V]) M N need {X.A[V N]) M assoc-L 
iX.E[n]) iiX.A[V]) M) need {X.A[{X.E[n]) V]) M, if A{E) ^ n assoc-R 

As mentioned, some adjustments to de Bruijn indices are necessary when 
performing substitution in A-calculus terms. For example, in a deref reduction, 
every free variable in the substituted V must be incremented by A{E) + 1. Oth- 
erwise, the indices representing free variables in V no longer count the number 
of As between their occurrence and their respective binding As. Similar adjust- 
ments are needed for the assoc-L and assoc-R reductions, where subterms are 
also pulled under As. 

Formally, define a function t that takes three inputs: a term M , an integer 
X, and a variable (index) m, and increments all free variables in M by x, where 
a free variable is defined to be an index n such that n > m. In this paper, we 
use the notation Mf^- Here is the formal definition of t: 
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nr^ = n + x,iin>m [M iV)t^ - {{MTJ (TVt?^)) 

nt^, = n, if n < m A. A/tf„ = A. {Mr„^+i) 

Using the t function for index adjustments, the notions of reduction are: 

{\.E[n]) V need {\.E[V-\^^^^'^]) V, if A{E) = n deref 

iX.A[V]) M N need {X.A[V (A^t^f *^^^)]) M assoc-L 

{X.E[n]) M) need {X.A[{{X.E[n])t^^^'^^) V]) M, if A{E) = n 

assoc-ij 

It is acceptable to apply the A function to A because A is a subset of E. 



3 Standard Reduction Machine 



In order to derive an abstract machine from the by-need A-calculus, Ariola and 
Felleisen prove a Curry-Feys-style Standardization Theorem. Roughly, the the- 
orem states that a term M reduces to a term in a canonical manner if M 
reduces to N in the by-need calculus. 

The theorem thus determines a state machine for reducing programs to an- 
swers. The initial state of the machine is the program, the collection of states is 
all possible programs, and the final states are answers. Transitions in the state 
machine are equivalent to reductions in the calculus: 

E[M] 1 — ^need E[M'], if M need M' 

where E represents the same evaluation contexts that are used to define the 
demand-driven substitution of variables in the deref notion of reduction. 

The machine is deterministic because all programs M satisfy the unique de- 
composition property. This means that M is either an answer or can be uniquely 
decomposed into an evaluation context and a redex. Hence, we can use the state 
machine transitions to define an evaluator function; 



eval„eed(M) = 



a, if M I — »„eed a 

_L, if for all M l »„eed N, N l >need L 



Lemma 1. evalneed is a total function. 

Proof. The lemma follows from the standard reduction theorem [2]. 



4 The CK+ Machine 



A standard reduction machine specifies evaluation steps at a high-level of ab- 
straction. Specifically, at each evaluation step in the machine, the entire program 
is partitioned into an evaluation context and a redex. This repeated partition- 
ing is inefficient because the evaluation context at any given evaluation step 
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tends to share a large common prefix with the evaluation context in the previ- 
ous step. To eliminate this inefhciency, Felleisen and Friedman propose the CK 
machine [6] Chapter 6] , an implementation for a standard reduction machine of 
a call-by-value language. Consider the following call-by- value evaluation: 

{{Xw.w) {{Xx.{x {{Xy.y) Xz.z))) Xx.x)) 
I — >y {{Xw.w) {{Xx.x) {{Xy.y) Xz.z) )) 
I — >v {{Xw.w) {{Xx.x) Xz.z)) 
I — >v {{Xw.w) Xz.z) 
I — Xz.z 



In each step, the /3„ redex is underlined. The evaluation contexts for the first and 
third term are the same, {{Xw.w) [ ]), and it is contained in the evaluation context 
for the second term, {{Xw.w) {{Xx.x) [ ])). Although the evaluation contexts in 
the first three terms have repeated parts, a standard reduction machine for the 
call-by-value calculus must re-partition the program at each evaluation step. 

The CK machine improves upon the standard reduction machine for the 
by-value A-calculus by eliminating redundant search steps. While the standard 
reduction machine uses whole programs as machine states, a state in the CK ma- 
chine is divided into separate subterm (C) and evaluation context (K) registers. 
More precisely, the C in the CK machine represents a control string, i.e., the 
subterm to be evaluated, and the K is a continuation, which is a data structure 
that represents an evaluation context in an "inside-out" manner. The original 
program can be reconstructed from a CK machine state by "plugging" the ex- 
pression in the C subterm register into the context represented by K. When the 
control string is a redex, the CK machine can perform a reduction, just like the 
standard reduction machine. Unlike the standard reduction machine though, the 
CK machine still remembers the previous evaluation context in the context reg- 
ister and can therefore resume the search for the next redex from the contractum 
in C and the evaluation context in K. 



4.1 CK+ Machine States 

We introduce the CK+ machine, a variant of the CK machine, for the by-need 
A-calculus. The CK-I- machine is also a modification of the abstract machine 
of Garcia et al. [3]. The machine states for the CK-I- machine are specified in 
figure [TJ The core CK-I- machine has three main registers, a control string (C), 
a "renaming" environment (i?), and a continuation stack {K). 

In figure [TJ the . . . notation means "zero or more of the preceeding element" 
and in the stack K, . . .||, the partial stack frame k is the top of the stack. 
The initial CK-|- machine state is (M, ( ), ||int||), where M is the given program, 
( ) is an empty renaming environment, and ||mt || is a stack with just one element, 
an empty frame. 
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S,T:~ {C,R,K) 
C :■- M 



control strings 
renaming environments 
offsets 



machine states 



R :~ {i, ...) 



i e N 
K:~ life, 7^, ...|| 
K :■- (bind M R k) 
k :■- mt 1 (arg M R k) \ (op K k) 



complete stack frames 
partial stack frames 



continuation stacks 



Fig. 1. CK+ machine states. 



4.2 Renaming Environment 

As mentioned in section [2l substitution requires some form of renaming, which 
manifests itself as lexical address adjustments when using a de Bruijn repre- 
sentation of terms. Instead of adjusting addresses directly, the CK-I- machine 
delays the adjustment by keeping track of offsets for all free variables in the con- 
trol string in a separate renaming environment. The delayed renaming is forced 
when a variable occurrence is evaluated, at which point the offset is added to 
the variable before it is used to retrieve its value from the control stack. 

Here we use lists for renaming environments and the offset corresponding to 
variable n, denoted i?(n), is the n-th element in R (0-based). The : function is 
cons, and the function M'^R applies a renaming environment i? to a term M, 
yielding a term like M except with appropriately adjusted lexical addresses: 



Because the CK-t- machine uses renaming environments, the t function from 
section [2] is replaced with an operation on R. When the machine needs to incre- 
ment all free variables in a term, it uses the © function to increment all offsets in 
the renaming environment that accompanies the term. The notation R(Bx means 
that all offsets in renaming environment R are incremented by x. Thus, the use 
of indices in place of variables enables hygiene maintenance through simple in- 
crementing and decrementing of the indices. As a result, we have eliminated the 
need to keep track of the "active variables" that are present in Garcia et al.'s 
machine O Section 4.5]. 

4.3 Continuations and the Continuation Stack 

Like the CK machine, the CK-t- machine represents evaluation contexts as con- 
tinuations. The [ ] context is represented by the mt continuation. An evaluation 



MH ) 



M 



n-^R. 
{X.M)<=R 
{M N)-^R 



n + R{n) 
X.{M<^{0:R)) 

{{M-^R) {N<^R)) 
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context E[{[ ] N)] is represented by a continuation (arg M R k) where k repre- 
sents E and {M<^R) = N. An evaluation context i?[(A.[ ]) A^] is represented by 
a continuation (bind M R k) where k represents E and {M<^R) = N. Finally, 
the E[{X.E'[n]) [ ]] context is represented by an (op K k) continuation. The E' 
under the A in the evaluation context is represented by the nested K stack in 
the continuation and the E surrounding the evaluation context corresponds to 
the k in the continuation. The op continuation does not need to remember the n 
variable in the evaluation context because the variable can be derived from the 
length of K. 

The contents of the K register represent the control stack of the program 
and we refer to an element of this stack as a frame. The key difference between 
the CK-|- machine and Garcia et al.'s machine is in the organization of the 
frames of the stack. Instead of a flat list of frames like in Garcia et al.'s machine, 
our control stack frames are groups of nested continuations of a special shape. 
Thus we also call our control stack a "continuation stack." We use two kinds of 
frames, partial and complete. The first frame in the continuation stack is always 
a partial one, while all others are complete. The outermost continuation of a 
complete frame is a bind and all other nested pieces of a complete frame are op, 
arg, or mt. Thus, not counting the first partial frame, there is exactly one frame 
in the control stack for every bind continuation in the program. As a result, the 
machine can use a variable (lexical address) n to find the bind corresponding to 
that variable in the control stack. 

4.4 Maintaining the Continuation Stack 

Each frame of the control stack, with the exception of the top frame, has the 
shape (bind M R fc), where fc is a partial frame that contains no additional 
bind frames. In order for the continuation stack to maintain this invariant, GK-|- 
machine transitions must adhere to two conditions: 

1. When a machine transition is executed, only the top partial frame of the 
stack is updated unless the instruction descends under a A. 

2. If a machine transition descends under a A, the partial frame on top of the 
stack is completed and a new mt partial frame is pushed onto the stack. 

Essentially, the top frame in the stack "accumulates context" until a A is 
encountered, at which time the top partial frame becomes a complete frame. 
Maintaining evaluation contexts for the program in this way implies a major 
consequence for the CK-|- machine: 

when the control string is a variable n, then the binding for n is (n -|- 
R{n) + 1) stack frames away. 

4.5 Relating Machine States to Terms 

Figure [2] defines the ip function, which converts machine states to A-terms. It uses 
the A/<^i? function to apply the renaming environment to the control string and 
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<pi{M,R,K)) = K[M^R] 




M 

fc[(M_(iV^_R))] 
k[{X.K[len{K) ~ 1]) M] 
k[{X.M) (N^R)] 



■ ■ ■ [Klk[M]]] 



Fig. 2. (p converts CK+ machine states to A-calculus terms. 



then uses a family of "plug" functions, dubbed ■[■], to plug the renamed control 
string into the hole of the context represented by the continuation component of 
the state. Figure [2] also defines these plug functions, where K[M] yields the term 
obtained by plugging M into the context represented by K, and A'[Af] yields 
the term when M is plugged into the context represented by the continuation 
stack K. 

4.6 CK+ Machine State Transitions 

Figure [3] shows the first four state transitions for the CK+ machine. The -H- 
notation indicates an "append" operation for the continuation stack. Since the 
purpose of the CK+ machine is to remember intermediate states in the search 
for a redex, three of the first four rules are search rules. They shift pieces of the 
control string to the K register. For example, the [shift- arg] transition shifts the 
argument of an application to the K register. 



ck + 



{(MN),R,\\k, K, ...||) 



[shift-arg] 

(M, R, Wiars N Rk), K, ...||> 



{X.M,R,\\{^rgN R' k), K, ...||> 



[descend- A] 

{M,0:R,\\mt, (hind NR'k), K, ...|[) 



[lookup- arg] 



(n, R, K++\\{bind N R' k), K, ...||) 
where len(K) = n + R{n) + 1 




(Kii,||(op^fc), K, ...||) 



[resume] 

{V,R' ,K++\\{\>±-nAV Rk), K, ...\\) 
where R' = R® len(K) 



Fig. 3. State transitions for the CK+ machine. 
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The [descend- A] transition shifts a A binding to the K register. When the 
control string in the CK-|- machine is a A abstraction, and that A is the operator 
in an application term — indicated by an arg frame on top of the stack — the body 
of the A becomes the control string; the top frame in the stack is updated to be 
a complete bind frame; and a new partial mt frame is pushed onto the stack. 

The [descend- A] instruction also updates the renaming environment which, as 
mentioned, is a list of numbers. There is one offset in the renaming environment 
for each bind continuation in the control stack and the offsets in the renaming 
environment appear in the same order as their corresponding bind continuations. 
When the machine descends into a A expression, a new bind continuation is 
added to the top of the control stack so a new corresponding offset is also added 
to the front of the renaming environment. Since offsets are only added to the 
renaming environment when the machine goes under a A, whenever a variable n 
(a lexical address) becomes the control string, its renaming offset is located at 
the n-th position in the renaming environment. A renaming offset keeps track 
of the relative position of a bind continuation since it was added to the control 
stack so a [descend- A] instruction adds a offset to the renaming environment. 

When the control string is a variable n, the binding for n is accessed from 
the continuation stack by accessing the {n -\- R{n) + l)-th frame in the stack. The 
[lookup-arg] instruction moves the argument that is bound to the variable into 
the control string register. The op frame on top of the stack is updated to store 
all the frames inside the binding A, in the same order that they appear in the 
stack. Using this strategy, the machine can "jump" back to this context after 
it is done evaluating the argument. For a term {X.E[n]) M, this is equivalent 
to evaluating M while saving E and then returning to the location of n after 
the argument M has been evaluated. Note that the [lookup-arg] transition does 
not perform substitution. The argument has been copied into the control string 
register, but it has also been removed from the continuation stack register. 

When the frame on top of the stack is an op, it means the current control 
string is an argument in an application term. When that argument is a value, 
then a redex has been found and the value should be substituted for the variable 
that represents it. The [resume] rule is the only rule in figure [3] that performs 
a reduction in the sense of the by-need calculus. It is the implementation of 
the deref notion of reduction from the calculus. Specifically, the [resume] rule 
realizes this substitution by restoring the frames in the op frame back into the 
continuation stack as well as copying the value into a new bind frame. The result 
is nearly equivalent to the left hand side of the [lookup-arg] rule except that the 
argument has been evaluated and has been substituted for the variable. 

Since the [resume] rule performs substitution, it must also update the renam- 
ing environment. Hence, the distance between V and its binding frame is added 
to every offset in the renaming environment R, as indicated by i? leii(K). In 
other words, each offset in the environment is being incremented by the number 
of bind continuations that are added to the control stack. 

In summary, the four rules of figure [3] represent intermediate partitions of the 
program into a subterm and an evaluation context before a partitioning of the 
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program into an evaluation context and a deref redex is found. As a result, the 
CK+ machine does not need to repartition the entire program on every machine 
step and is therefore more efScient than standard reduction. To complete the 
machine now, we must make it deal with answers. 

4.7 Dealing with Answers 

The CK+ machine described so far has no mechanism to identify whether a 
control string represents an answer. The by-need calculus, however, assumes 
that it is possible to distinguish answers from terms on several occasions, one of 
which is the completion of evaluation. To efficiently identify answers, the CK-I- 
machine uses a fourth "answer" register. The CK+ machine identifies answers 
by searching the continuation stack for frames that are answer contexts. To 
distinguish answer contexts from evaluation contexts, we characterize answer 
contexts in figure ID A final machine state has the form {^V,R,\\ || ,^). 



S, T :■- {C\ R, K) I (V, R, ||F, . . . , K, . . .|| , ^) machine states 

F ::= (bind M R mt) answer (complete) frame 

A::— ||mt, F, ...|| answer stacks 

Fig. 4. CK+ machine answer states. 



When the control string is a value V and mt is the topmost stack frame, then 
some subterm in the program is an answer. In this situation, the mt frame in the 
stack is followed by an arbitrary number of F frames. The machine searches for 
the answer by shifting mt and F frames from the continuation stack register to 
the answer register. The machine continues searching until either a K frame is 
seen or the end of the continuation stack is reached. If the end of the continuation 
stack is reached, the entire term is an answer and evaluation is complete. 

The presence of a, K frame means an assoc-L or an assoc-R redex has been 
found. In order to implement these shifts, the CK+ machine requires four ad- 
ditional rules for handling answers, as shown in figure [5] The [ans-searchl] rule 
shifts the mt frame to the answer register. The [ans-search2] rule shifts F frames 
to the answer register. The [assoc-L] rule and the [assoc-R] rule roughly corre- 
spond to the assoc-L and assoc-R notions of reduction in the calculus, respec- 
tively. The rules are optimized versions of corresponding notions of reduction 
in the calculus because the transition after the reduction is always known. The 
[assoc-L] machine rule performs the equivalent of an assoc-L reduction in the 
calculus, followed by a [descend- A] machine transition. The [assoc-R] machine 
rule performs the equivalent of an assoc-R reduction in the calculus, followed by 
a [resume] machine transition. 
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cfc + 



{V,R,\\mt, K, ...||> 




{V,R,\\F', K, ...\\,\\mt, F, ...||) 




[assoc-L] 



{X.M',R, \\{hindM R' (arg iV R" k)), K, . . .|| , |[mt, F, . . .||) 



{M',0:R,\\mt, (bind iV i?'" mt), F, {hindM R' k), K, ...||> 

where R'" = R" ® len([|F, . . .||) + 1 




{V,R",K'+i'\\{hindV Rmt), F, . . . {hindM R' k), K, ...||) 
where K' = K®len{\\F, ...||) + 1, and R" = R®len{K') 



Fig. 5. Transitions of the CK+ machine that handle answer terms. 



In figure [SJ the function © has been extended to a family of functions de- 
fined over renaming environments, continuation stacks, and stack frames: R(Bx 
increments every offset in the renaming environment R hy x and the function 
K (B X increments every offset in every renaming environment in every frame in 
K by X. The function len(||F, . . .||) returns the number of frames in . . .||. 
Maintaining the offsets in this manner is equivalent to obeying Garcia et al.'s 
"well-formedness" condition on machine states. 

4.8 Correctness 

Correctness means that the standard reduction machine and the CK-I- machine 
define the same evaluator functions. Let us start with an appropriate definition 
for the CK-|- machine: 



Recall that the function ip converts CK-f- machine states to A-calculus terms 
(figure [5]). Here, ip has been extended to handle "answer" machine states: 




ck+ T 



ip{{M, R, K,A))^ K[A[M^R]] 
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The desired theorem says that the two eval functions are equal 
Theorem 1. evalneed ^ svalck+- 

To prove the theorem, we first estabhsh some auxihary lemmas on the totahty 
of evalck+ and the relation between CK+ transitions and standard reduction 
transitions. 

Lemma 2. evalck+ is a total function. 

Proof. The lemma is proved via a subject reduction argument. □ 
The central lemma uses (p to relate CK+ machine transitions to reductions. 
Lemma 3. For all CK+ machine states S and T , if S i — >ck+ T , then either 

tp{S) I ^need V{T) Or ip{S) = ip{T). 

Proof. We proceed by case analysis on each machine transition, starting with 
[resume]. Assume 

{V,R,\\{opK k), K, ...||) 

{V,R®len{K),K++\\{hindV Rk), K, ...||) , 

then let 

Ah^ipi{V,R,\\iopK k), K, ...\\)) 

= \\K, ...\\ [k[{\.K[-len{K)~l]){V^R)]] 
M2 = 99((T^,i?©len(i?),i?++||(bindFi?fc), ...[!)) 

= ...\\[k[{\.K[V<^{R®ler^{K))]){V^R)]] . 

Since Mi is a standard deref redex, we have: 

\\K, ...\\ [k[{\.K[leii{K) - 1]) {V^m ^need 

\\K, ...\\m\.K[{V^R)\T'^^'>]){y^R)]] 

To conclude that Mi i — s^need M2 by the deref notion of reduction, we need to 
show: 

(F^i?)tr^'^^- y^{R ® len(i^)) 

LemmalDproves the general case for this requirement. Therefore, we can conclude 
that Ml I — 5- need M2. The proofs for [assoc-L] and [assoc-R] are similar. 

As for the remaining instructions, they only shift subtcrms/contexts back 
and forth between registers, so the proof is a straightforward calculation. □ 

Lemma 4. Vi?, i?2, where R = R1++R2 and m = len{Ri): 

(M^i?)t™= M^{Ri+M.R2 © x)) 
Proof. By structural induction on M . □ 
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Using lemma [31 the argument to prove our main theorem is straightforward. 

Proof (of Theorem]^. We show evalcfe+(A/) — a eval„eed(-^^) = a- 

The left-to-right direction follows from the observation that for all CK-|- 

machine starting states S and final machine states S final, if S i — »ck+ S final, 

then M i — »need «, where f{S final) = a. This is proved using lemma |3] and 

induction on the length of the i — »-ck+ sequence. 

The other direction is proved by contradiction. Assume evalneed(Af) = a 

_L and evalc/c+(M) ^ a. Since evalcfc+ is a total function, either: 

1. (M,(),||mt||)^ 

cfc+ S final 5 

where (p{S final) ^ a, or 

2. the reduction of (M, ( ), ||mt||) diverges. 

It follows from the left-to-right direction of the theorem that, in the first 
case, evalneed(Af) = fiS final) 7^ a, and in the second case, evalneed(A/) = -L. 
However, eval„eed (-^^^) — ol was assumed and evalneed is a total function, so a 
contradiction has been reached in both cases. Since none of the cases are possible, 
we conclude that if evalneed(-W) = a, then evalcfc+(M) = a. □ 

5 Stack Compacting 

Because the by-need A-calculus does not substitute the argument of a function 
call for all occurrences of the parameter at once, applications are never removed. 
In the CK-|- machine, arguments accumulate on the stack and remain there 
forever. For a finite machine, an ever-growing stack is a problem. In this section, 
we explain how to compact the stack. 

To implement a stack compaction algorithm in the CK-I- machine, we in- 
troduce a separate SC machine which removes all unused stack bindings from 
a CK-I- machine state. Based on the SC machine, the CK-I- machine can be 
equipped with a non-deterministic [sc] transition; 

(Af, R, k) (M, i?', k') [sc] 

where((FV A/i?0),(M,i?),i^,|| ||) {TAM,R'),\\ \\,k') 

Figure [S] presents the SC machine. In this figure, FV refers to a family of 
functions that extracts the set of free variables from terms, stack frames, and 
continuation stacks. The function FV takes a term M , a renaming environment 
R and a variable m, and extracts free variables from M, where a free variable 
is defined to be all n such that n + R{n) > m. The function FV is similarly 
defined for stack frames and continuation stacks. In addition, J- — denotes 
the set obtained by decrementing every element in T by one. Finally, k@k 
represents a frame merged appropriately into a continuation stack. For exam- 
ple, K, . . . , (bind M R k")\\ @k = \\k', K, . . . , (bind M R k")@k\\, where 
(bind M R k")@k = (bind M R k"@k), and so on, until finally int@fc = k. 

Also in figure [6l tt denotes a family of functions that adjusts the offsets 
in renaming environments to account for the fact that a A has been removed 
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■— i^J-, [M, R),IC, K) machine states 

:= {n, . . .} set of free variables 

:= ||fc, K, . . .\\ I \\K, ...|| partial stacks 



[shift-partial- frame] 

{F,{M,R),\\k, K, ...||,|| II) (^U(FVfcO),(A/,i?),||7^, ...||,||fc||) 

[shift-complete- frame] 

{T,{M,R),\\K', K, ...||,||fc, K", ...||) {T',iM,R),\\K, ...\\,\\k, K" , 7^'||) 
QeT where T' = [T—) U (FV K' 0) 

[pop-frame] 

(-F, (M,7?),||(bindAf iJfc), K, ...\\,K) {j'—,{M,R'),\\K, ...\\,K'@k) 

^ T where R' = (M, i?)nrel(K)-i 

and K' = Kn:l,^,_, 



Fig. 6. The SC machine. 



from the term. If a variable n refers to a bind stack frame that is deeper in the 
stack than the frame that is removed, then the offset for that variable needs to 
be decremented by one. A variable n refers to a bind that is deeper than the 
removed frame if n + R{n) is greater than the depth of the removed frame. The 
tt function can be applied to renaming environments directly or to continuation 
stacks or stack frames that contain renaming environments. We use the notation 
(M, i?)tt? to mean that the offsets in R are incremented by x for all variables n 
in M where n+R{n) > £. The result of (M, i?)ttf is a new renaming environment 
with the adjusted offsets. The notation -R'tt^ means that the offsets for all M 
and R pairs in the continuation stack K are adjusted. K^^^ evaluates to a new 
continuation stack that contains the adjusted renaming environments. 



6 Related Work and Conclusion 

The call- by- need calculus is due to Ariola et al. |2I7I8| . Garcia et al. [3] derive an 
abstract machine for Ariola and Felleisen's calculus and, in the process, uncover 
a correspondence between the by-need calculus and delimited control operations. 
Danvy et al. jHI derive a machine similar to Garcia et al. by applying "off-the- 
shelf" transformations to the by-need calculus. 

Our paper has focused on the binding structure of call-by-need programs 
implied by Ariola and Felleisen's calculus. We have presented the GK-|- ma- 
chine, which restructures the control stack of Garcia et al.'s machine, and we 
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have shown that lexical addresses can be used to directly access binding sites for 
variables in this dynamic control stack, a first in the history of programming lan- 
guages. The use of lexical addresses has also simplified hygiene maintenance by 
eliminating the need for the set of "active variables" that is present in Garcia et 
al.'s machine states. In addition, we show how using indices in place of variables 
allows for simple maintenance of Garcia et al.'s "well- formed" machine states. 
Finally, we have presented a stack compaction algorithm, which is used in the 
CK4- machine to prevent stack overflow. The compaction algorithm used in this 
paper is a restriction of the more general garbage collection notion of reduction 
of Felleisen and Hieb ^Oj and is also reminiscent of Kelsey's work ^llj . 

Acknowledgments. Thanks to the anonymous reviewers for their feedback 
and to Daniel Brown for inspiring discussions. 
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